Dual Reinforcement Q - Routing : an on - Line Adaptive Routing Algorithm 1
نویسندگان
چکیده
This paper describes and evaluates the Dual Reinforcement Q-Routing algorithm (DRQ-Routing) for adaptive packet routing in communication networks. Each node in the network has a routing decision maker that adapts, on-line, to learn routing policies that can sustain high network loads and have low average packet delivery time. These decision makers learn based on the information they get back from their neighboring nodes as they send packets to them (forward exploration similar to Q-Routing) and the information appended to the packets they receive from their neighboring nodes (backward exploration unique to DRQ-Routing). Experiments over several network topologies have shown that at low loads, DRQ-Routing learns the optimal policy more than twice as fast as Q-Routing, and at high loads, it learns routing policies that are more than twice as good as Q-Routing in terms of average packet delivery time. Further, DRQ-Routing is able to sustain higher network loads than Q-Routing and non-adaptive shortest-path routing.
منابع مشابه
Conndence Based Dual Reinforcement Q-routing: an Adaptive Online Network Routing Algorithm
This paper describes and evaluates the Conndence-based Dual Reinforcement Q-Routing algorithm (CDRQ-Routing) for adap-tive packet routing in communication networks. CDRQ-Routing is based on an application of the Q-learning framework to network routing, as rst proposed by Littman and Boyan (1993). The main contribution of CDRQ-routing is an increased quantity and an improved quality of explorati...
متن کاملDual Reinforcement Q - Routing : an on - Lineadaptive Routing Algorithm
This paper describes and evaluates the Dual Reinforcement Q-Routing algorithm (DRQ-Routing) for adaptive packet routing in communication networks. Each node in the network has a routing decision maker that adapts, on-line, to learn routing policies that can sustain high network loads and have low average packet delivery time. These decision makers learn based on the information they get back fr...
متن کاملConfidence Based Dual Reinforcement Q-Routing: An adaptive online network routing algorithm
This paper describes and evaluates the Confidence-based Dual Reinforcement Q-Routing algorithm (CDRQ-Routing) for adaptive packet routing in communication networks. CDRQ-Routing is based on the Qlearning framework of Q-Routing. The main contribution of this work is the increased quantity and improved quality of exploration in CDRQ-Routing, which lead to faster adaptation and better routing poli...
متن کاملComparison of the Q-Routing and Shortest Path Routing Algorithms
In this paper, we compare the self-adaptive QRouting and dual reinforcement Q-Routing algorithms with the conventional shortest path routing algorithm. The Q-Routing algorithm embeds a learning policy at every node to adapt itself to the changing network conditions, which leads to a synchronised routing information, in order to achieve a shortest delivery time. Unlike Q-Routing, the shortest pa...
متن کاملCS 229 Final Report: Location Based Adaptive Routing Protocol(LBAR) using Reinforcement Learning
In this paper we present an algorithm for a location based adaptive routing protocol that uses both geographic routing and reinforcement learning to maximize throughput in our mobile vehicle network. We use reinforcement learning to determine the correct direction to forward a packet and then use geographic routing to forward a packet toward the network sink. We use an extension of the q-routin...
متن کامل